STD and SCR Techniques and Their Evaluations on the NTCIR-10 SpokenDoc-2 Task
نویسندگان
چکیده
This paper describes spoken term detection (STD) and spoken contents retrieval (SCR) techniques and their evaluations at the NTCIR-10 SpokenDoc-2 task. First of all, we describes our STD technique using a phoneme transition network (PTN) derived from multiple speech recognizers’ outputs and its evaluations at the STD and the iSTD (inexistent STD) tasks. Next, we introduce our SCR technique using Web documents for expanding the target spoken documents. It is evaluated on the two SCR tasks.
منابع مشابه
Spoken Document Retrieval Experiments for SpokenDoc-2 at Ryukoku University (RYSDT)
In this paper, we describe spoken document retrieval systems in Ryukoku University, which were participated in NTCIR-10 IR for Spoken Documents (“SpokenDoc-2”) task. In NTCIR-10 “SpokenDoc-2” task, there are two subtasks: “spoken term detection (STD) subtask” and “ad-hoc spoken content retrieval (SCR) subtask”. We participated in the SCR subtask as team RYSDT. In this paper, our SCR systems are...
متن کاملDTW-Distance-Ordered Spoken Term Detection and STD-based Spoken Content Retrieval: Experiments at NTCIR-10 SpokenDoc-2
In this paper, we report our experiments at NTCIR-10 SpokenDoc-2 task. We participated both the STD and SCR subtasks of SpokenDoc. For STD subtask, we applied novel indexing method, called metric subspace indexing, previously proposed by us. One of the distinctive advantages of the method was that it could output the detection results in increasing order of distance without using any predefined...
متن کاملSpoken Term Detection and Spoken Content Retrieval: Evaluations on NTCIR 11 SpokenQuery&Doc Task
In this paper, we report out experiments on NTCIR-11 SpokenDoc&Query task for spoken term detection (STD) and spoken content retrieval (SCR). In STD, we consider acoustic feature similarity between utterances over both word and sub-word lattices to deal with the general problem of open vocabulary retrieval with queries of variable length. In SCR, we modify term frequency using expected term fre...
متن کاملOverview of the NTCIR-10 SpokenDoc-2 Task
This paper describes an overview of the IR for Spoken Documents Task in NTCIR-10Workshop. In this task, the spoken term detection (STD) subtask and ad-hoc spoken content retrieval subtask (SCR) are conducted. Both of the tasks target to search terms, passages and documents included in academic oral presentations. This paper explains the data used in the tasks, how to make transcriptions by spee...
متن کاملSpoken Document Retrieval Experiments for SpokenDoc at Ryukoku University (RYSDT)
In this paper, we describe spoken document retrieval systems in Ryukoku University, which were participated in NTCIR-9 IR for Spoken Documents (“SpokenDoc”) task. In NTCIR-9 “SpokenDoc” task, there are two subtasks: “Spoken term detection (STD) subtask” and “Spoken document retrieval (SDR) subtask”. We participated in the both subtasks as team RYSDT. In this paper, first, our STD systems are de...
متن کامل